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ABSTRACT 

The acidic peadands of southern Brazil are ecosystems essential for the maintenance of the Atlantic Forest, 
one of the 25 hot-spots of biodiversity in the world. In this work, we investigated the composition of 
prokaryotic communities in four histosols of three acidic peatland regions by constructing small-subunit 
(SSU) rRNA gene libraries and sequencing. SSU rRNA gene sequence analysis showed the prevalence of 
Acidobacteria (38.8%) and Proteobacteria (27.4%) of the Bacteria domain and Miscellaneous (58%)) and 
Terrestrial (24%)) groups of Crenarchaeota of the Archaea domain. As observed in other ecosystems, 
archaeal communities showed lower richness than bacterial communities. We also found a limited number 
of Euryarchaeota and of known methanotrophic bacteria in the clone libraries. 
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INTRODUCTION 

Peatlands are ecosystems with high quantity of organic 
matter due to high water table levels and low temperatures. In 
the water-saturated layer, the lack of oxygen significantly 
retards degradation of organic matter, causing plant residues to 
accumulate as peat. Worldwide, it is estimated that an area of 
about 400 million ha is covered with peatland, equivalent to 3 
%o of the Earth's surface. Most of this area, about 350 million 
ha, is in the northern hemisphere, harbouring over one-third of 
the global reserves of soil organic carbon (44). In Brazil, it is 



estimated that these environment cover approximately 611,883 
ha, i.e., around 0.07 %o of the national territory (55). 

The acidic peatlands of the Parana state in southern Brazil 
are contained within the high-altitude grasslands that cover 
28,000 km^ of the state. This ecosystem contains herbaceous 
and bryophytes species and the soils are covered by Sphagnum 
mosses, the dominant component of the peat bog vegetation 
(15). This ecosystem belongs to the Brazihan Atlantic Forest 
and is essential for the maintenance of the biome, which is one 
of the 25 biodiversity hot-spot in the world (38). Occurring in 
altitudes higher than 1,000 meters above sea level, these 
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Sphagnum peat bogs are large reservoirs of carbon which filter 
and regulate the water flow of the main hydrographic basins of 
southem Brazil (Curcio et al., unpublished data). 

Sphagnum peat bogs are one of the oldest known 
landscape type in Brazil (3) and their unique environment for 
many plants and animals are also an extreme habitat for 
microorganisms due to their high acidity (pH 3.0-5.0), low 
temperature and low concentration of mineral nutrients (43). 
Due to the environmental and ecological significance of 
Sphagnum peat bogs in the global carbon budget and in the 
emission of methane, their microbial communities are of 
considerable interest. Up to now there is no record of 
microbiological studies of Sphagnum peat bogs in southem 
Brazil. In this study, we have used culture-independent SSU 
rRNA gene sequencing approach to determine the composition 
of the archaeal and bacterial communities in three acidic 
peatland regions from the southem Brazilian Atlantic Forest. 

MATERIALS AND METHODS 

Localization and site description 

Histosols were collected in Febraary 2008 from three 
different physiographic units from sites where the water table 
was at the peat surface level. In the Serra do Mar (25° 53.5'S; 
48° 57.5 'W), the samples were collected from a depth of 20 cm 
below of the water table (SM20); in the Second Plateau (25° 
16.4'S; 49°49.5'W) from depths 20 (CG20) and 50 cm (CG50) 
below the water table and in the Third Plateau (26° 34.4'S; 51° 
41.7'W) from 20 cm below the water table (PA20). The three 
chosen areas are preserved Sphagnum peat bogs from the 
southem Brazilian Atlantic Forest. Differences in the landscape 
of these ecosystems are related to the use of their surrounding 
areas. Adjacent to the Serra do Mar peatland is the upper 
mountain Atlantic Rainforest; to the Second Plateau, are 
pasture fields and to the Third Plateau, agricultural crops. In 
these areas the climate is temperate and humid without major 
dry periods. The annual precipitation average is between 1,400 



and 2,200 mm and the annual temperature ranges mainly 
between 12 and 18°C (3). The acidic peatland histosol is 
dominantly covered with Sphagnum moss, containing species 
of the Poaceae and Cyperaceae and small shmbs of Asteraceae, 
Melastomataceae, Ericaceae, Eriocaulaceae and Verbenaceae 
families. Ten histosol samples were randomly collected from 
an area of approximately 400 m^ (20m x 20m) and mixed into a 
composite sample of each site. From each sample, 
approximately 100 grams of soil were transferred to Falcon 
tubes and kept on ice for DNA extraction. The rest of the 
samples was collected in sealed plastic bags, stored at room 
temperature and sent for chemical analysis to the Department 
of Soil and Agricultural Engineering of the Federal University 
of Parana (UFPR, Brazil) (Table 1). 

Histosol DNA extraction and quantification 

One gram of histosol was used for nucleic acids extraction 
with the "Ultraclean Soil DNA kit" (MO BIO Laboratories) 
according to the manufacturer's instmctions. The DNA 
integrity was determined by electrophoresis in IX TAE-0.8% 
agarose gels and stained with ethidium bromide (0.5 )ag/mL). 

PCR amplification 

Bacterial and archaeal SSU rRNA genes were amphfied 
from 50 ng of soil DNA by PCR. Prior to the PCR, samples 
were incubated with 0.6 ng of BSA per ng of DNA in a total 
volume of 1 1 ^1 for 45 min at 37°C to scavenge PCR inhibiting 
substances. After chilling on ice, in a PCR final volume of 50 
^1, the following reagents were added: Ix PCR buffer 
(Invitrogen), 0.2 ^M of each primer, 2 mM MgCl2, 0.2 mM 
dNTP (Invifrogen), 0.3 ng BSA and 1 U Taq DNA polymerase 
(Invitrogen). Amplifications were performed in a Mastercycler 
gradient thermocycler (Eppendorf). The primers 27F (5'AGAG 
TTTGATCMTGGCTCAG) and 1492R (5'GGTTACCTTGTT 
ACGACTT) (51) were used for the amphfication of bacterial 
SSU rRNA and the primers 2 IF (5' TTCYGGTTGATCCYGC 
CRGA) and 958R (5' YCCGGCGTTGANTCCAATT) (37) 
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were used for the amplification of the archaeal SSU rRNA 
gene. The following conditions were used for PCR with both 
primer sets: 3 min at 95 "C; 25 cycles of 45 s at 95''C, 45 s at 
55°C and 1.5 min at 12°C; and a final extension for 10 min at 
72°C. The quantity and quality of the amplification products 
were verified by gel electrophoresis in IX TAE-1% agarose 
and analysed with the LabWorks™ 4.0 Imaging Analysis and 
Acquisition software (UVP Inc.). 

SSU rRNA gene cloning and sequencing 

Approximately 75 ng of SSU rRNA gene amplicons were 
ligated to 50 ng of pGEM-T easy vector (Promega), according 
to the manufacturer's instructions. The ligation product was 
transformed into Escherichia coli DH5a competent cells. Eight 
clone libraries were constructed: four contained bacterial SSU 
rRNA gene inserts (SM20Bac, CG20Bac, CG50Bac and 
PA20Bac) and four contained archaeal SSU rRNA sequences 
(SM20Arq, CG20Arq, CG50Arq and PA20Arq). Plasmids 
were extracted by the alkaline lysis method in 96 well plates. 
Sequencing was performed using 250 ng of plasmid DNA, 3.25 
pmol of primer (27F for Bacteria and 21F/958R for Archaea), 
3 nL of ET terminator mix (GE Healthcare) and ultrapure H2O 
to a final volume of 7.5 )aL, in 1 cycle of 1 min at 95°C; 35 
cycles of 20 s at 94°C and 2 min at 62°C. Products were 
precipitated with three volumes of absolute ethanol, washed 
with 70% ethanol, dried and dissolved in deionized formamide. 
Sequencing was performed using a MegaBACE 1000 
sequencer (GE Healthcare). Reads were trimmed for the 
removal of low quality bases using the Phred program (19). For 
the Archaea domain, the Phrap program (26) was used to 
assemble the reads from both primers used for sequencing into 
the SSU rRNA partial gene sequence and the Consed program 
was used to view and edit the sequence assembly (25). The 
obtained SSU rRNA gene sequences were deposited in the 
GenBank database under the accession no. HQ6 13951 - 
HQ614492 (archaeal sequences) and HQ614493-HQ615057 
(bacterial sequences). 



Sequence analyses 

SSU rRNA gene sequences were aligned and analyzed 
with RDP tools (release 10) (14) and with the Mothur program 
(47). The chimeras, if detected, were removed with 
Bellerophon using the Huber-Hugenholtz correction. Chimera 
Check or Pintail using as template the Greengenes database for 
Bacteria and Silva database for Archaea. Alignments were 
edited manually or with GBlocks (9). Nucleotide substitution 
models were selected by using PAUP* version 4.0b 10 (52) and 
ModelTest (41). Sequences were clustered into OTU 
(Operational Taxonomic Unit - identity > 97%). Phylogenies 
were constructed with the neighbor-joining algorithm by using 
the program MEGA version 4.0 (53). The similarity in the 
composition of the clone libraries was examined by two 
methods: 1-LibShuff (46) and R (42) programs, using an 
abundance matrix of OTUs. Renyi diversity profiles to rank 
prokaryotic communities according to diversity (33). The chi- 
square test was used to verify differences in the frequencies of 
the phyla in samples. 

RESULTS AND DISCUSSION 

Bacterial and archaeal communities from acidic peatlands 

In this work we identified the major bacterial and archaeal 
groups present in acidic peatland from the southern Brazilian 
Atlantic Forest by SSU rRNA gene sequencing from the Serra 
do Mar (SM20), the Second Plateau (CG20 and CG50) and the 
Third Plateau (PA20). 565 sequences of the bacterial SSU 
rRNA V2-V3 region with an average length of 450pb were 
recovered. The majority of the sequences belong to the 
Acidobacteria (33.8%) and Proteobacteria (26.2%) phyla (Fig. 
lA). These two groups were the most numerous found in both 
forest soil and in Sphagnum peat bogs (17, 29). Four bacterial 
clones of Proteobacteria were related to methanotrophs 
(Methylococcaceae and Methylocystaceae families), with only 
one sequence showing 96% similarity with the genus 
Methylocystis. From the three studied areas, only the Third 
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Plateau (Fig. lA, PA20Bac) contained a higher number of 
representatives of Proteobacteria than Acidobacteria. Since the 
ProteobacterialAcidobacteria ratio has been correlated to the 
soil trophic status (27, 49), PA20 probably diverged from the 
other samples in this characteristic because of the anthropic 
interference caused by agriculture in the surrounding areas. The 
peatland of the Second Plateau, showed the highest content of 
organic matter (Table 1) and the highest abundance of 
Acidobacteria (Fig. lA). The relationship between the 
prevalence of Acidobacteria and soil composition can be 
explained in part by their particular metabolism, since the 
acidophillic Acidobacteria are oligotrophs, characterized by 
slow growth rates and metabolism of more refractory carbon 
substrates, a characteristic of peat soils (18). 

The composition of the dominant groups in the Second 
Plateau site changed with depth of sample collection, and the 
Ktedonobacter genus (Bacteria_incertae_sedis phylum), which 
includes spore-forming, gram-positive filamentous bacteria 
(10), was the second most abundant group at the 50 cm depth 
(Fig. lA). An increase in the abundance of gram-positive 
bacteria with depth in a soil profile was also observed by Fierer 
et al. (22). However, cluster analysis by Bray-Curtis distances 
(Fig. 2B) and j-Libshuff analysis (P>0.05) revealed that the 
samples from depths of 20 and 50 cm of the same peatland 
(CG20Bac and CG50Bac libraries) are the most similar 
communities. These soil samples had very similar chemical 
composition (Table 1) which probably explains the high 
similarity of the two libraries. 

Phylogenetic analysis of dominant groups showed that 
representatives of the Acidobacteria (Gpl, Gp2 and Gp3) and 
of the classes of Proteobacteria {Alpha, Beta, Gamma and 
Delta) were present in all samples, contributing to the high 
similarity of the libraries (Fig. SI and S2). The taxa responsible 
for the observed differences belonged mainly to the least 
abundant phyla: Firmicutes, Verrucomicrobia, Bacteria_ 
inceriae_sedis, Bacteroidetes, Chloroflexi and Actinobacteria 
(Fig. lA). 



One study of a Sphagnum peat bog in Western Siberia 
showed that Verrucomicrobia was the third most abundant 
phylum (17) and other studies based on SSU rUNA sequencing 
suggested that Verrucomicrobia represent one of the most 
dominant phyla of soil environments (29, 32). They typically 
represent between 2% and 8% of the total bacterial community, 
with the highest densities observed in the rhizosphere (32). 
However, no representative of this phylum was found in the 
peatland of Serra do Mar (SM20Bac library) (Fig. lA). This 
was the only peatland area surrounded by the Atlantic 
Rainforest. A recent study of soil of the Atlantic Rainforest 
also found no representatives of this phylum (20). 

Distribution analysis showed a high percentage of unique 
OTUs in all four bacterial libraries varying from 58 to 80% 
(Table 1). Only two OTUs belonging to the Gpl and Gp2 
genera were present in all libraries. Representatives of the class 
Deltaproteobacteria were also present in all pairwise 
comparisons (Table 2). 

A total of 542 sequences of archaeal SSU rRNA spanning 
V2-V5 region with an average length of 800 bp were also 
obtained. In contrast to that observed for peatlands of the 
northern hemisphere (8, 30), methanogens do not appear to 
dominate archaeal communities of the southern Brazilian 
Atlantic Forest peatlands. In these ecosystems 90% of the 
sequences belonged to the Crenarchaeota phylum, and only 
9%) were from methanogens, which belonged to the 
Euryarchaeota phylum (Fig. IB). 

Two groups were dominant within the phylum 
Crenarchaeota: Miscellaneous Crenarchaeotic Group (MCG, 
58%) and Terrestrial Group (TG, 24%) (Fig. IB). MCG has a 
wide habitat range, which includes terrestrial and marine, hot, 
cold, surface or subsurface environments (54). The presence of 
MCG in the SM20, CG20, CG50 and PA20 peatlands confirms 
the wide distribution of this group. The TG contains 
crenarchaeotal sequences that were mainly retrieved from soil 
samples and the rhizosphere (40, 48). The wide distribution of 
TG in soil indicates that it is one of the most globally 



664 



Etto, R.M. et al. 



Prokaryotic communities of acidic peatlands 



distributed archaeal group in terrestrial habitats. However, the 
ecophysiology and function of Crenarchaeota remain unclear 
due to the lack of cultured isolates. 

As observed for Bacteria, the archaeal communities 
detected in samples from different depths of the same peatland 
(CG20Arq and CGSOArq libraries) were the most similar, but 
in contrast to the bacterial libraries, the SM20Arq and 
PA20Arq archaeal libraries exhibited the highest similarity 
(Fig. 2C). Phylogenetic analysis of crenarchaeotal groups 
showed that the representatives of MCG were responsible for 
the high similarity of the SM20Arq and PA20Arq libraries 
(Fig. S3). The four libraries shared two OTUs of MCG and TG 
(Table 2). 

Soil depth seemed to influence the distribution of the 
archaeal taxa. The archaeal community from the sample 
collected at 50 cm showed a lower percentage of unshared 
OTUs (40.8%) than that from 20 cm (66.7% of unique OTUs) 
(Table 1). 

In this study, the SSU rRNA gene sequence of the 
following methanogens were found: Rice Cluster-I (RC-I), 
Rice Cluster-II (RC-II), Methanosaeta, Gulf of Mexico Arc I 
group (GOM Arc I), Methanomicrobiaceae, Candidatus 
Methanoregula and Methanocella (Fig. 3). Parwise 
comparisons between libraries indicated that SM20Arq and 
PA20Arq shared one OTU of the Methanomicrobiaceae family 
and one of the Rice Cluster II group; CG20Arq and PA20Arq 
shared one OTU of the GOM ArcI group; and CGSOArq and 
PA20Arq shared one OTU of the Rice Cluster II (Table 2). No 
methanogenic OTU was shared between the libraries CG20Arq 
and CGSOArq or SM20Arq and CG20Arq. 

The RC-I and RC-II groups have been found in different 
ecosystems with active methanogenesis, such as acidic 
peatlands and rice paddies (8, 12, 24). Members of the RC-I 
group, found only in the Second Plateau (CG20Arq and 
CGSOArq libraries) (Fig. 3), are characterized as 
hydrogenotrophic methanogens and have been found to 
produce methane near rice roots (4S) by using the accumulated 



H2, generated from the fermentation of secreted photosynthates 
(16). This group is particularly abundant in rice paddies, where 
it can make up 20 to S0% of the total number of methanogens 
(34). However it represented only 4% of the methanogens in 
our area of study, which is similar to other peatlands (7). RC-II 
was the most abundant methanogen group with representatives 
in the PA20Arq, SM20Arq and CGSOArq libraries (Fig. 3). 
Members of RC-II seem to tolerate a broad range of pH values, 
since sequences have been retrieved from acidic as well as 
neutral sites (2, 12,31). 

Sequences clustering with Methanoregula spp. were 
present in libraries SM20Arq and PA20Arq (Fig. 3). These are 
likely to represent hydrogenotrophic and/or formatotrophic 
methanogens, since members of this group are known to utilize 
H2/CO2 (4) or H2/CO2 or formate (57). This group is abundant 
in other oligotrophic peatlands (23, 50), and has been shown to 
grow in low-nutrient and low-ionic-strength medium and to be 
highly sensitive to oxygen (5), suggesting its adaptation to 
extreme anaerobic environments. Other sequences of the 
hydrogenotrophic orders Methanomicrobiales and 
Methanocellales were also found. 

Taxa of the group GOM Arc I (36) present in PA20Arq, 
CG20Arq and CGSOArq (Fig. 3) were closest to uncultured 
archaea of sediment and peatland (6, 7, 39). Although this 
group is phylogenetically close to the group ANME-2 
(Archaeal ANaerobic MEthane oxidizers, 36), there is no 
evidence of methane-oxidation by members of the GOM arc I. 

The sequence related to acetoclastic Methanosaeta was 
found only in PA20Arq and comprised 11% of the total 
methanogens (Fig. 3). The absence of other acetotrophs 
belonging to the family Methanosarcinaceae in this peatland 
suggests a low concentration of acetate in this bog, since in 
ecosystems where acetate concentrations are high 
Methanosaeta are outcompeted by Methanosarcina (13, 21, 
56). 

In agreement with Aller and Kemp (1), who compared 
bacterial and archaeal richness of several environments and 
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observed that in the majority of the studied ecosystems the 
bacterial richness was higher than archaeal richness, the 
southern Brazil acidic peatlands also had a higher richness of 
Bacteria than Archaea (Fig. 4). Besides, the Third Plateau 
(PA20) peatland showed higher procariotic diversity among the 
samples studied (Fig. 2A). 

In conclusion, the predominance of hydrogenotrophic 



methanogens (89% of the methanogens) suggests that the main 
production of CH4 occurs through the H2-CO2 pathway. This 
result is consistent with other studies, which showed that 
hydrogenotrophy is the dominant reaction in bogs (11, 23, 28, 
35). Furthermore, this is also the first study to reveal the 
prokaryotic diversity in the southern Brazilian Atlantic Forest 
peatlands, showing a higher richness of Bacteria than Archaea. 
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Figure 1. Taxonomic composition of the prokaryotic communities. Sequences of (A) Bacteria and (B) Archaea domains were 
compared with Ribosomal Database Project II and Silva Database respectively. Significant differences with respect to phyla 
frequencies per sample were observed in a Chi-square test. 
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Figure SI. Phylogenetic relationships among acidobacterial OTUs and their areas of origin. Each circle represents a different 
OTU and the number after its ID indicates the number of sequences that compose this OTU. Each colour represents a different 
area as shown in the picture. The bar indicates the proportion of sequences of each genus. Phylogenies were estimated by neighbor 
joining with the TrN model and y correction (a = 0.4882). Tree is unrooted. The scale bar represents changes per nucleotide. 
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Figure 2. Renyi diversity profiles and similarity dendograms based in OTU. (A) Comparison of the Renyi diversity profiles of the peatland 
prokaryote communities. The profile values for alpha = 0, 1,2 and infinity indicate the species richness, Shannon diversity index, logarithm of 
the reciprocal Simpson diversity index and Berger Parker diversity index, respectively. If the profile for one sample was consistently higher than 
the profile for another sample, the sample with the higher profile was considered more diverse. When curves for two communities intersect, this 
means that they cannot be ranked. (B) and (C) show an average linkage clustering dendrogram based on the Bray-Curtis distances among 
bacterial and archaeal communities, respectively. 



Table 1. Distribution of OTUs and soil chemical attributes of the samples 



Sample 


Bacterial OTU 
Unshared Shared 


Archaeal OTU 
Unshared Shared 


pH 

CaCl, 


g/kg 
C 


g/kg 
OM 


SM20 


69 (80.2%) 


17(19.8%) 


43 (74.1%) 


15 (25.9%) 


3.20 


256.11 


441.54 


CG20 


48 (60.7%) 


31 (39.3%) 


46 (66.7%) 


23 (33.3%) 


3.80 


265.81 


458.50 


CG50 


37 (58.7%) 


26 (41.3%) 


22 (40.8%) 


32 (59.2%) 


3.80 


267.01 


460.10 


PA20 


136 (80%) 


34 (20%) 


61 (69.3%) 


27 (30.7%) 


4.00 


172.57 


297.51 



C= carbon; 0M= organic matter. 



Table 2. Prokaryotic OTUs shared among the libraries. The sequences that compose the bacterial and archaeal OTUs were compared with 



Ribosomal Database Project II and Silva database respectively. The number of OTUs that compose each taxon are shown in parentheses. 



Libraries 


Bacteria shared 


Areiiaea sliared 


SM20 CG20 


Gpl (3), Gp2 (4), Gammaproteobacteria (1), Deltaproteobacteria (1), Proteobacteria (3) and 
Bacteria (1). 


Ten-estrial Group (3) and MCG (4). 


SM20 CG50 


Gp3 (1), Gp2 (2), Gpl (2) and Deltaproteobacteria (1). 


Terrestrial Group (3), Rice Clusterll (2) and 
MCG (5). 


SM20 PA20 


Deltaproteobacteria (1), Gp3 (1), Gp2 (2), Gpl (2) and Bacteria (2). 


Ten'cstrial Group (2), Methanomicrobiaceae 
(1), Rice Clusterll (1) and MCG (6). 


CG20 CG50 


Ktedonobacter {\), Betaproteobacteria (1), Deltaproteobacteria (2), Gpl3 (1), Gpl (3), Gp2 (4) 
and Bacteria (1). 


MCG (7), Thermoprotei (1) and Terrestrial 
Group (10). 


CG20 PA20 


Anaerolineales (1), Nitrospira (1), Gp3 (1), Gp2 (2), Gpl (4), Subdivisions genera incertae sedis 
(1), Betaproteobacteria (1), Deltaproteobacteria (2), Proteobacteria (2) and Bacteria (3). 


Terrestrial Group (1), GOM Arcl (1) and 
MCG (10). 


PA20 CG50 


Subdivisions genera incertae sedis (1), Ktedonobacter (1), Desulfuromonadales (1), 
Deltaproteobacteria (1), Gp3 (1), GplS (1), Gpl (3), Gp2 (5) and Bacteria (2). 


Ten'cstrial Group (3), Rice Clusterll (1), 
Thermopotei (1) and MCG (13). 


SM20 CG20 CG50 


Gp2 (2), Gpl (1) and Deltaproteobacteria (1). 


Terrestrial Group (3) and MCG (3). 


SM20 CG20 PA20 


Proteobacteria (1), Gp2 (1) and Gpl (1) 


Terrestrial Group (1) and MCG (4) 


SM20 CG50 PA20 


Gp2(\)andGpl (1). 


Terrestrial Group ( 1 ) Rice Clusterll ( 1 ) and 
MCG (3). 


CG20 CG50 PA20 


Deltaproteobacteria (1), Gp2 (2) and Gpl (2). 


Ten-estrial Group (1) and MCG (6). 


SM20 CG20 CG50 PA20 Gp2 (1) and Gpl (1). 


MCG (3) and Tenestrial Group (1). 
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Figure S2. Phylogenetic relationships among proteobacterial OTU and their areas of origin. Each circle represents a different OTU and the number 
after its ID indicates the number of sequences that compose this OTU. Each colour represents a different area as shown in the picture. The bar 
indicates the proportion of sequences of each class. Phytogenies were estimated by neighbor joining with the TrN model and y correction (a = 
0.6241). The tree is unrooted. The scale bar represents changes per nucleotide. 
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Figure 3. Phylogenetic relationships among methanogenics OTUsand their areas of origin. Each circle represents a different OTUand the number 
after its ID indicates the number of sequences that compose this OTU. Each colour represents a different area as shown in the picture. The bar 
indicates the proportion of sequences in each group. Sequences were compared with the NCBI and Silva database for the construction of the tree. 
Phylogenies were estimated by neighbor joining with the TrN model and y correction (a =1.0852). The tree was rooted using Thermofilum pendens 
and Thermoproteus tenax as an outgroup. The scale bar represents changes per nucleotide. 
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Figure S3. Phylogenetic relationships among crenarcheotal sequences and their origin. Each colour represents a different area as shown in 
the picture. Sequences were classified using Silva database. Phylogenies were estimated by neighbor joining with the LogDet. Tree is 
unrooted. The scale bar represents changes per nucleotide. Each symbol represents a different area. Clades of the Terrestrial Group (TG) and 
Micellaneous Crearchaeota Group (MCG) are indicated on the tree. 
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Figure 4. Rarefaction curve analyses of bacterial and archaeal SSU rRNA gene sequences across all sites. Bacterial sequences are 
represented by light gray diamonds and archaeal by dark gray squares. 
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